Robustness of group delay representations for noisy speech signals

نویسندگان

Sree Hari Krishnan Parthasarathi

R. Padmanabhan

Hema A. Murthy

چکیده

This paper demonstrates the robustness of group delay based features to additive noise. First, we analytically show the robustness of group delay based representations. The analysis makes use of the fact that, for minimum-phase signals, the group delay function can be represented in terms of the cepstral coefficients of the log-magnitude spectrum. Such a representation results in the speech spectrum dominating over the noise spectrum, both at low and high SNRs. Further, we experimentally demonstrate the robustness of the representation on a voice activity detection (VAD) task, comparing a group delay based VAD algorithm with standard VAD methods as well as a magnitude-spectrum based method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Zeros of the z-transform (ZZT) representation and chirp group delay processing for the analysis of source and filter characteristics of speech signals

This study proposes a new spectral representation called the Zeros of Z-Transform (ZZT), which is an all-zero representation of the z-transform of the signal. In addition, new chirp group delay processing techniques are developed for analysis of resonances of a signal. The combination of the ZZT representation with the chirp group delay processing algorithms provides a useful domain to study re...

متن کامل

Robust pitch estimation in noisy speech using ZTW and group delay function

Identification of pitch for speech signals recorded in noisy environments is a fundamental and long persistent problem in speech research. Several time domain based techniques attempt to exploit the periodic nature of the waveform using autocorrelation function and its variants. Other set of techniques utilize the harmonic structure in the spectral domain to identify pitch values. Either of the...

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Speech Enhancement of Multiple Moving Sources Based on Subband Clustering Time-delay Estimation

A new robust blind microphone array method to enhance speech signals generated by multiple moving sources in a noisy environment is presented. This approach is based on a two-stage scheme. A subband clustering time-delay estimation algorithm is first used to localize the dominant speech sources. The speech enhancement is performed in a second stage, based on the acquired spatial information, by...

متن کامل

Speech Enhancement Through an Optimized Subspace Division Technique

The speech enhancement techniques are often employed to improve the quality and intelligibility of the noisy speech signals. This paper discusses a novel technique for speech enhancement which is based on Singular Value Decomposition. This implementation utilizes a Genetic Algorithm based optimization method for reducing the effects of environmental noises from the singular vectors as well as t...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

I. J. Speech Technology

دوره 14 شماره

صفحات -

تاریخ انتشار 2011

Robustness of group delay representations for noisy speech signals

نویسندگان

چکیده

منابع مشابه

Zeros of the z-transform (ZZT) representation and chirp group delay processing for the analysis of source and filter characteristics of speech signals

Robust pitch estimation in noisy speech using ZTW and group delay function

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Speech Enhancement of Multiple Moving Sources Based on Subband Clustering Time-delay Estimation

Speech Enhancement Through an Optimized Subspace Division Technique

عنوان ژورنال:

اشتراک گذاری